| 1. | The chinese automatic word segmentation is an important part in the chinese information processing 汉语自动分词是中文信息处理中的重要环节。 |
| 2. | The present paper analyzes such problems as the validity , effect , precision , tolerance , compulsion and limit of automatic word segmentation 机器分词时会遇到分词的正确性、加工精度的可容性、机器分词的强制性、机器分词的局限性等问题。 |
| 3. | Nowadays , research on chinese information processing focuses on chinese automatic word segmentation , parsing , but seldom in automatic term extraction 目前,国内对中文信息处理的研究主要集中在汉语自动分词、语法分析上,对术语自动抽取的研究还不是很多。 |
| 4. | Besides these , the model of the chinese automatic words segmentation describedin this dissertation can be used to deal with the words segmentation in the situation of command lines 另外,本文所描述的汉语自动分词模块已可以在基于命令行的情况下,进行分词处理。 |
| 5. | The way to segmentation and the anticipated functional criterion that are suited to this subject are illustrated , at last the concrete design of the chinese automatic words segmentation are described , including the overall design and the design of each model 最后详细描述了汉语自动分词模块的具体设计,包括总体设计以及各模块设计等,同时给出了一些关键性的例程说明和程序设计的关键点总结。 |
| 6. | The application of artificial neural network to solve the problem of chinese automatic word segmentation is presented . the mapping model and its performance are studied . based on a number of experiments , the performance of the model is evaluated 神经网络分词是今后分词技术发展的一个趋势,本文对分词神经网络进行了研究,建立了分词神经网络的实验系统,利用分词神经网络进行了歧义字段划分的实验。 |
| 7. | The automatic and accurate identification of chinese organization names is very significant to improve the accuracy of automatic word segmentation , and it will establish a good foundation for natural language comprehension , machine translation , information extraction and information retrieval 中文机构名称的自动识别对提高汉语自动分词的精确率有着重要的意义,也是自然语言理解、机器翻译、信息抽取和信息检索的基础。 |
| 8. | Refer to chinese automatic word segmentation based on statistics , this paper imports the mechanism of open learning , and uses the method of supervised and unsupervised learning . the word segmentation model includes credibility revising and partial tri - gram information 本文在基于统计的汉语自动分词的基础上,引入开放学习机制,通过有监督和无监督相结合的学习方法,建立包含可信度修正和部分三元语法信息的多元分词模型。 |
| 9. | Chinese information processing model is added to the traditional search engine , which can make search engine intelligent and personalized . chinese automatic word segmentation is the first work in chinese information processing . in this paper , a chinese word segmentation system is studied , which fits for intelligence search engine 针对歧义字段的划分问题,提出了歧义字段划分的三个原则,在三原则的基础上给出了“二字续分法”分词的方案,该方案能够快速有效的分解大部分的歧义字段,具有很高的实用价值。 |
| 10. | Chinese automatic word segmentation is the fundamental task of the chinese information processing . it mainly comprises of three difficult questions , including word criterion , disambiguation , unknown word identifying . many researchers have contributed to this field , but in the present days , it still needs pursuing higher precision 汉语自动分词是中文信息处理领域的基础课题,而且也是进行其它中文信息处理的前提,它有三个主要难点分别是分词规范,歧义字段切分和未登录词,国内外许多研究人员在这一领域都进行了深入的研究,但就目前现状来看,分词的正确率仍然有提升的空间。 |